Deit Base Distilled Patch16 384
Apache-2.0
A distilled vision transformer model, pre-trained at 224x224 resolution and fine-tuned on ImageNet-1k at 384x384 resolution, learning from a teacher model via distillation tokens.
Image Classification
Transformers